XML-based data integration
نویسنده
چکیده
Today’s fast and continuous growth of large business organizations enforces an increasing need in integrating and sharing large amounts of data, coming from a number of heterogeneous and distributed data sources. Thus, during the last decade, research and business community interest has migrated from DataBase Management Systems (DBMS) to data integration systems. Whereas the former make a unique local data source accessible through a schema, the latter offer the necessary framework to combine the data from a set of heterogeneous and autonomous sources through a socalled global schema. We call such a system a hierarchical data integration system. Today, one more step is being achieved towards so-called Peer-to-Peer (P2P) data integration systems, which are characterized by an architecture constituted by various autonomous nodes which hold information, and are linked to each other by means of mappings. Another technical progress, somehow orthogonal to the aforementioned one, is the one that concerns with the impressive growth of Internet and the consequent proposal of the XML standard for the exchange of data on the Web. Our goal is to develop a formal framework that captures all the issues related to hierarchical data integration systems, whose global schema is expressed by means of an XML schema language. In particular, we will start by considering the case of a global schema specification given by means of a DTD and a set of XML integrity constraints. Moreover, we plan to investigate issues related to XML-based P2P data integration systems. Our research will be done in joint supervision of the department of Computer Science of the University of Rome ”La Sapienza” and the GEMO Project, born from the merging of INRIA-Rocquencourt Verso Project and the IASI group of the University of Paris-Sud.
منابع مشابه
Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملApply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملGrid Data Integration Based on Schema Mapping
Data integration is the flexible and managed federation, analysis, and processing of data from different distributed sources. Data integration is a key issue for exploiting the availability of large, heterogeneous, distributed and highly dynamic data volumes on Grids. This paper presents a framework for integrating heterogeneous XML data sources distributed among the nodes of a Grid. We present...
متن کاملOntology-based heterogeneous XML data integration
In this paper we present an ontology-based method for formalizing the implicit semantic and we suggest mechanisms to semantically integrate XML schemas and documents as well. After a survey of database interoperability, we present our semantic integration approach by explaining the nature of ontology. The article then presents our integration method for XML data and schemas using a generic onto...
متن کاملQuery rewriting for open XML data integration systems
This paper presents OpenXView, a model for open XML data integration systems, characterized by the autonomy of users that publish XML data on a common topic. Autonomy implies frequent and unpredictable changes to data and a high degree of structure heterogeneity. The OpenXView model provides an original integration schema, based on a hybrid ontology XML schema structure. We propose solutions fo...
متن کاملXML Data Integration by Graph Restructuring
This paper describes the integration of XML data sources within the AutoMed heterogeneous data integration system. The paper presents a description of the overall framework, as well as an overview of and comparison with related work and implemented solutions by other researchers. The main contribution of this research is an algorithm for the integration of XML data sources, based on graph restr...
متن کامل